Learning High-Density Regions for a Generalized Kolmogorov-Smirnov Test in High-Dimensional Data
نویسندگان
چکیده
We propose an efficient, generalized, nonparametric, statistical KolmogorovSmirnov test for detecting distributional change in high-dimensional data. To implement the test, we introduce a novel, hierarchical, minimum-volume sets estimator to represent the distributions to be tested. Our work is motivated by the need to detect changes in data streams, and the test is especially efficient in this context. We provide the theoretical foundations of our test and show its superiority over existing methods.
منابع مشابه
High-Dimensional Unsupervised Active Learning Method
In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the da...
متن کاملFeasibility of using statistical tests in evaluation of non-uniformity [Persian]
Introduction: Non-uniformity test is essentially the only required daily QC procedure in nuclear medicine practice. Noise creates statistical variation or random error in a flood image. Non-uniformity on the other hand does not have statistical nature and may be regarded as systemic error. The present methods of non-uniformity calculation do not distinguish between these two types of erro...
متن کاملA Kolmogorov-Smirnov Correlation-Based Filter for Microarray Data
A filter algorithm using F-measure has been used with feature redundancy removal based on the Kolmogorov-Smirnov (KS) test for rough equality of statistical distributions. As a result computationally efficient K-S CorrelationBased Selection algorithm has been developed and tested on three high-dimensional microarray datasets using four types of classifiers. Results are quite encouraging and sev...
متن کاملSmall Improvement to the Kolmogorov-Smirnov Test
The Kolmogorov-Smirnov (K-S) test is widely used as a goodness-of-fit test. This thesis consists of two parts to describe ways to improve the classical K-S test in both 1-dimensional and 2-dimensional data. The first part is about how to improve the accuracy of the classical K-S goodness-of-fit test in 1-dimensional data. We replace the p-values estimated by the asymptotic distribution with nea...
متن کاملInvestigating the challenges of teaching and learning Arabic in the high schools of Zabol County1
Purpose: This paper aims to evaluate the teaching and learning processes of the Arabic course in the high schools of Zabol County. Methodology: Descriptive-correlational method was applied as the research method and the statistical population was comprised of two groups – students and teachers of Arabic course. It had a practical aim and relies on the general hypothesis stating that the Arabic ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012